Revisit Long Short-Term Memory: An Optimization Perspective

نویسندگان

  • Qi Lyu
  • Jun Zhu
چکیده

Long Short-Term Memory (LSTM) is a deep recurrent neural network architecture with high computational complexity. Contrary to the standard practice to train LSTM online with stochastic gradient descent (SGD) methods, we propose a matrix-based batch learning method for LSTM with full Backpropagation Through Time (BPTT). We further solve the state drifting issues as well as improving the overall performance for LSTM using revised activation functions for gates. With these changes, advanced optimization algorithms are applied to LSTM with long time dependency for the first time and show great advantages over SGD methods. We further demonstrate that large-scale LSTM training can be greatly accelerated with parallel computation architectures like CUDA and MapReduce.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The effect of intrahippocampal microinjection of Naloxone on short –term and long-term memory in adult male rats

Introduction:The hippocampus is one for the major centers of learning and memory. Role of the opioid system has been investigated and on the other hand receptors related to this system such as mu-opioid receptors (MOR) are extended in the hippocampus. In this study the effect of Naloxone administration as a mu opioid receptor antagonist on passive avoidance memory in adult male rats was i...

متن کامل

Effects of exercise on spatial memory deficits induced by nucleus basalis magnocellularis lesions

Introduction: Previous studies have shown that exercise enhances cognitive and functional capacities in patients with Alzheimer's disease (AD). In this study, we investigated the effect of long-term (60 days) and short- term (10 days) exercise on the spatial memory deficits in an animal model of AD. Methods: Fifty male rats were divided into 5 groups 1) intact, 2) sham, 3) sham-Alzheimer 4) ...

متن کامل

P7: The Roles of Long-Term Memory on the Organization of the Knowledge for Educators

Modern neuroscientific research help to solve the impotent challenge in curriculum design and teaching for enhancing students’ ability to organize information in a way that makes it efficient in response to an appropriate context such as problem solving and critical thinking via knowing about the mechanism of different type of memories especially long term memory. At first, we should to c...

متن کامل

Effects of fresh, aged and cooked garlic extracts on short- and long-term memory in diabetic rats

Objective: The present study was hypothesized to investigate the beneficial effects of fresh, aged, and cooked garlic extracts on blood glucose and memory of diabetic rats induced by streptozocine (STZ). Material and Methods: Diabetes was induced by an intraperitoneal injection of STZ (60 mg/kg body weight). An oral dose of 1000 mg/kg of each garlic extract was given daily for 4 weeks after dia...

متن کامل

Prediction of Covid-19 Prevalence and Fatality Rates in Iran Using Long Short-Term Memory Neural Network

Introduction: The rapid spread of COVID-19 has become a critical threat to the world. So far, millions of people worldwide have been infected with the disease. The Covid-19 pandemic has had significant effects on various aspects of human life. Currently, prediction of the virus's spread is essential in order to be safe and make necessary arrangements. It can help control the rate of its outbrea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014